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ABSTRACT 

Intrinsic variations of the projected density profiles of clusters of galaxies at fixed mass 
are a source of uncertainty for cluster weak lensing. We present a semi-analytical model 
to account for this effect, based on a combination of variations in halo concentration, 
ellipticity and orientation, and the presence of correlated haloes. We calibrate the pa¬ 
rameters of our model at the 10 per cent level to match the empirical cosmic variance 
of cluster profiles at M 2 oom ~ 10 14 ... 10 15 /i _ 1 Mq, 2r = 0.25 .. .0.5 in a cosmological 
simulation. We show that weak lensing measurements of clusters significantly under¬ 
estimate mass uncertainties if intrinsic profile variations are ignored, and that our 
model can be used to provide correct mass likelihoods. Effects on the achievable accu¬ 
racy of weak lensing cluster mass measurements are particularly strong for the most 
massive clusters and deep observations (with ss 20 per cent uncertainty from cosmic 
variance alone at M 2 oom ~ 10 15 /i _ 1 Mq and 2 = 0.25), but significant also under 
typical ground-based conditions. We show that neglecting intrinsic profile variations 
leads to biases in the mass-observable relation constrained with weak lensing, both 
for intrinsic scatter and overall scale (the latter at the 15 per cent level). These biases 
are in excess of the statistical errors of upcoming surveys and can be avoided if the 
cosmic variance of cluster profiles is accounted for. 

Key words: gravitational lensing: weak - galaxies: clusters: general - cosmology: 
observations 


1 INTRODUCTION 


The largest objects ever formed depend most sensitively on 
small changes in the overall properties of the Universe. It 
is for this reason that clusters of galaxies are a versatile 
probe of cosmology. The number density of clusters and its 
evolution with redshift is influenced by the expansion his¬ 
tory, the density and level of inhomogeneity of matter in the 
Universe and the growth of structures by means of gravita¬ 
tion. Both the parameters of a standard ACDM model and 
deviations from primordial Gaussianity or General Relativ- 
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dividual analyses and I Allen et alll201ll for a recent review). 

The building blocks of such analyses are (i) a clus¬ 
ter catalog with wel l-defined selection function based on, 
e.g. optical, X-ray or lSunvaev fc Zel’dovichl (Il972l . hereafter 
SZ) properties, (ii) a prediction, based on theory or sim- 
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ulations, of how the number density of clusters at given 
mass and redshif t depends on the cosmological parameters 
in question (e.g. [Press fe Schechteilll974l : ISheth fe Tormenl 
119991 : [Tinker et al.ll2008l 'l and (iii) a mass-observable relation 
(MOR) that connects the observable from (i) to the mass- 
based prediction from (ii) in terms of a likelihood. The latter 
must not only describe the mean relation of mass and ob¬ 
servable but, because of the observable limited nature of any 
practical survey, a lso the intrinsic sc atter in the observable 
at fixed mass (e.g. [Lima fe Hull2005h . 

With the advent of large cluster catalogs, our imperfect 
knowledge of the MOR remains the most important limiting 
factor of cluster cosmology. It is difficult to predict or sim¬ 
ulate, to the level of accuracy required, all physical effects 
that influence the observables, particularly ones that are of 
baryonic nature. Thus, one needs to rely on an empirical 
calibration of the MOR. 

The latter can in principle be done by means of 
self-calibration, i.e. by constraining both cosm ology and 
MOR parameters from the cluster catalog alone dHul 12003 : 
iMaiumdar fe Mohrll2004l ; iLima fe Hul 1200a) . However, this 
approach greatly reduces the cosmological power of cluster 
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studies, especially when extensions to the most simple MOR 
and cosmological models are considered. 


It is for this reason that weak lensing studies of clusters 
of galaxies are a particularly powerful complement to clus¬ 
ter cosmology. Being sensitive to all matter independent of 
its astrophysical state, lensing allows, in principle, an unbi¬ 
ased measurement of cluster mass. Several studies have used 
this approach to constrain MORs with lens ing mass mea- 
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of further improving statistics with on-going and future large 
surveys. 


However, for accurate constraints on the mean MOR 
and its intrinsic scatter to be derived, one needs to ensure 
that the mass likelihood from a lensing analysis includes 
all actual sources of uncertainty. For typical cluster lensing 
studies, the dominant uncertainty is observational, i.e. based 
on the limited number of intrinsically elliptical background 
galaxies. Since all structures between sources and observer 
cause a lensing signal, an additional, irreducible uncertainty 
results from the variance of the matter densit y along the 
line of sight (LOS; cf. e.g. iHoekstrallioOll . l2003h . Most clus¬ 
ter lensing studies include either only the first or both of 
these effects when fitting the signature of a mass dependent 
density profile to observed data. 

What this neglects, either entirely or apart from a low¬ 
dimensional approximation in terms of e.g. halo concentra¬ 
tion, is the complexity of cluster profiles. The projected den¬ 
sity profiles of two clusters at the same virial mass can differ 
greatly, as could be described accurately only with detailed 
information about the shape and orientation of their central 
dark matter halo and the positions and masses of subhaloes 
and neighbouring structures. Due to its relatively low res¬ 
olution, no weak lensing analysis is able to uncover all or 
even most of this information. However, the uncertainty of 
mass measurement will depend on the statistical properties 
of these variations. 


The goal of this work is to provide a model for the 
variations in projected density profiles of cluster of galax¬ 
ies at fixed mass. We construct our model using analytical 
templates for the expected scatter due to variations in halo 
concentration, ellipticity and orientation, and its substruc¬ 
ture and cosmic neighbourhood. We re-scale these templates 
to match the empirical variations of cluster profiles in a cos¬ 
mological simulation, where both well-defined true masses 
and the lensing signal without any observational noise are 
known. We then use this semi-analytical covariance model 
to test and predict the accuracy of lensing measurements of 
cluster mass and MORs, and to assess the effect of neglect¬ 
ing intrinsic variations. 

The structure of this paper is as follows. In Section[2]we 
briefly describe the simulations used. Section [3] defines the 
components of our model for the mean cluster profile and, in 
particular, its intrinsic covariance. In Section U we explain 
how the model is fitted to the simulated data. Effects of the 
intrinsic covariance on weak lensing cluster mass measure¬ 
ments and MOR studies are shown in Section [5] before we 
summarize in Section [6] 


Conventions 


Our calcul ations are presented in a flat WMAP 7 year 
cosmology dKomatsu et al.l l201lTl with (Q m ,Qb,crs,h,n) = 
(0.27,0.044,0.79,0.7,0.95), in consistency with the simula¬ 
tions used. The covariances can be readily re-scaled to dif¬ 
ferent sets of cosmological parameters, up to a potential cos¬ 
mology dependence of our model parameters. We explicitly 
write appropriate factors of h = Ho/ (100 km s -1 Mpc -1 ) 
where applicable. We denote the radii of spheres around the 
cluster centre with fixed overdensity as r Am, where A = 200 
is the overdensity factor of the sphere with respect to the 
mean matter density p m at the cluster redshift. The mass 
inside these spheres is labelled and defined correspondingly 
as -Ax a mP m ‘ 

We use a linear matter power spectrum Pu n (k,z) = 
D 2 (z)Pu n (k, 0) wi th the transfer function model including 
baryonic effects of lEisenstein fe Hr] dl998l l and normalize to 
our value of erg. For the linear growth fa ctor D(z), normal¬ 
ized to -D(O) = 1, we use the expression of lPeacock fc Doddsl 
(jl99d l. We define the corresponding linear matter two-point 
correlation as 


6 in(r,z) =2 ^ f dkk 2 PUk,zf-^j^ . ( 1 ) 

For the scaling of some of the cluster properties we will 
use the common definition of peak height 


v = S c /a(M 200 m,z) , (2) 


where 8 C = 1.686 and 



dk k 2 Pn n (fc, 0) 


w 


cy 2 (M 2 oom,z) = D 2 (z) 
( ( 3M200m ^ ^ ^ 

) ' 


( 3 ) 


with the Fourier transform of the top-hat window function 
of radius r, w(k,r). 


2 SIMULATIONS 

We u se the simulation labeled L1000W in iTinker et al.l 
d200Sl l. It consists of 1024 3 dark matter particles of mass 
6.98 x 10 10 /i _1 Mq in a box of comoving size 1 h~ 1 Gpc, 
simulated with the parallelized Adaptive Refinement Tree 
algorithm dKravtsov et al.|[l997l : lGottloeber fe Klypinll2008l ~) 
from redshift a = 60 to 2 = 0, at an effective spatial resolu¬ 
tion of 30/i _1 kpc. In a snapshot at redshift z = 0.24533 
almost 15,000 haloes at 0.95 x 10 14 /i _1 Mq ^ M 2 oom ^ 
1.5 x 10 15 /i _1 Mq are identified. 

_ The same haloes are also used in iBecker fe Kravtsov! 

(l201lll and we employ the lensing maps computed in that 
work. For the selected haloes, the mass is integrated along 
the LOS and the lensing signal is determined on a grid of 
approximately 40 comoving kpc/pixel using the Born ap¬ 
proximation as described in IBecker fe Kravtsov! d201 ll . Sec¬ 
tion 3). All matter within comoving ±200/i~ 1 Mpc along the 
LOS and, transversely, in a square with comoving side length 
20/i _1 Mpc centered on the cluster is included in the calcu¬ 
lation. Background sources are assumed to be at a constant 
redshift z s = 1, where E cr it = 4.22 x 10 15 /iMqMpc -2 . The 
density map inside a clustrocentric radius 9 ^ 1' is subject 
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to resolution effects, which is why we discard it from our 
analysis. 


3 MODEL DEFINITION 

In the following, we introduce our model for the covariance of 
projected cluster profiles. Its components, namely the model 
for the mean profile (Section 13.111 and the contributions to 
the covariance ('Section 13.21) . are described subsequently. 

Consider the convergence profile of a given cluster with 
mass M 2 oom =: M as 

K = k{M) + E , (4) 

where K is the observed convergence profile and k(M) is 
the mean convergence profile of clusters of fixed mass M. E 
is the residual between the two, which arises due to observa¬ 
tional uncertainty and variations of the noiseless observable 
k profile at fixed mass. These variations could be due to un¬ 
correlated structures along the LOS or the projected cluster 
density profile itself. The latter intrinsic variations are the 
focus of this work. 

The profiles are vectors measured in a system of radial 
bins, where bin 1 < i < n is defined as 9 £ [d i) m i n , 9 it ma x] 
without gaps, i.e. 0i, ma x = 

Approximating the residuals E as a multivariate Gaus¬ 
sian with zero mean, one can write the likelihood for an 
observed K given a mass M as 

C(K\M) = 1 = 

det(C(M)) 

x exp(-i E t C-\M)E^ , (5) 

where we have introduced the covariance C of residuals E. 
This n x n matrix is defined as 

Cij =Cov(E i ,E j ) . (6) 

We make a semi-analytic ansatz for a parametric model 
of C as 

C{M) = C oha + C LSS + c conc (v)C conc (M) 

+ c COII C)C COII (M) + c ell O)C ell (M) 

(7) 

with contributions from observational uncertainty (i.e., 
shape noise in shear measurement and Poisson noise in mag¬ 
nification studies) C obs , uncorrelated large scale structure 
along the LOS (7 LSS , scatter in halo concentration (7 conc , 
correlated secondary haloes near the cluster C corr and vari¬ 
ations in halo ellipticity and orientation C el1 . The terms (7 obs 
and C LSS are independent of the cluster itself. In contrast to 
the common case where only these are considered, however, 
C(M) inherits a mass dependence from the latter terms, 
which we will call intrinsic variations of the cluster profile. 
They are described in detail in Section r3.2l The empirical re¬ 
scaling factors c*(iy) of C COTT , C el1 and (7 conc are determined 
by fitting the covariance model to the simulated haloes (see 
Section [1}. 


3.1 Mean profile 

Several studies in the past have proposed functional forms 
for the three-dimensional density of dark matter haloes. 


iNavarro. Frenk, fe Whitel d 19971 . hereafter NFW) found that 
haloes are well-fit by the two-parametric broken power-law 
profile 


PnfwM 


_Po_ 

(r/r a )(l +r/r a ) 2 ’ 


( 8 ) 


with scale density po and a scale radius r a that can be 
expressed as a fraction of r 2 oom using the concentration 
C200m = r200m/r s . 


Two additional effects need to be accounted for in a 
realistic description of halo density profiles. Firstly, the en¬ 
closed mass of an NFW profile diverges logarithmically as 
r —» oo. The density must therefore be truncated at large 
radii, for w hich a number of approaches have been pro¬ 
posed (e.g. Thkada^_Jain|_ 20031 : lHavashi fe Whit j 120081 : 
iBaltz. Marshall, fe Qgurill2009l . hereafter BMO). Secondly, 
the halo is embedded in an overdensity of correlated matter 
that contributes to the overall projected profile at all, and 


dominantly at large, ra dii. We closely follow the work of 
lOeuri fe Hamanal (1201 11 1, who fit a superposition of a BMO 
profile with a linear two-halo term to simulated haloes, i.e. 


E(M) = k{M) E cr it = Elh,BMo(M) + E 2 h,lin(M) . ( 9 ) 

We describe the one-halo (Section [370} and two-halo term 
(Section l3. 1.211 in the following. 


3.1.1 One-halo term 

BMO define a truncated version of the NFW profile as 
/ 2 \ P 

PbmoW = PnfwM x ( —- —-- j , (10) 

\ ( r / r 200m ) + r 200m J 

where we use j3 = 2. Here we have introduced T 2 oom, the mul¬ 
tiple of r 2 oom around which the halo is smoothly truncated. 
The BMO profile can be analytically integrated to yield the 
projected profile Eih.BMO, as we ll as the change in mass nor¬ 
malisation du e to missing density rela tive to the NFW pro¬ 
file (cf. BMO; lOeuri fe Hamanall201ll and the ample code, 
http://kipac.stanford.edu/collab/research/lensing/ 
ample/ample. c). 

Besides mass, our one-halo term has two additional free 
parameters, the concentration C 200 m and truncation radius 
T 2 oo m . We fix the fo r mer w i th the mass-concentratio n re¬ 
lation of lDuffv et al.l (l2008l l. iQguri fe Hamanal d201 ill note 
a weak dependence of TA vlr m on mass. Our scaling rela¬ 
tion of t 200 m is determined by fitting the mean k profile 
of haloes in our simulation (cf. Section [5} in logarithmic 
bins of A log 10 M 2 oom = 0.05 in the radial range of 3 to 15 
arcmin using the full profile of eqn. O with rasa free pa¬ 
rameter. The uncertainties for our x 2 minimization w.r.t. r 
are derived from the ensemble variance of mean k in the re¬ 
spective mass bin. We find that the relation is well described 
as a linear function of v. Our fit in v £ [2.2, 3.7] yields 

f 3.85 - 0.73i/ ix ^ 3.7 

T2oom - | L15 i* > 3.7 ^ ’ 

where we have assumed a constant value for T 2 oom above the 
peak height range covered by our simulated haloes. 

We note that the fact that more massive haloes are, 
on average, truncated at smaller multiples of their r 2 oom 
radii could be interpreted in terms of the steepening of the 
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de nsity slope with increasing mass accretion rate observed 
by iDiemer fe Kravtsov! (12014 ). 


3.1.2 Two-halo term 


The two-halo surface mass density due to linear evolution 
arou nd a cluster of mass M at redshift 2 can be written as 
(e.g. lOeuri fe Hamanall201ll 'l 


£ 2 h,iin(0) = bh{M, z) p c fi W(6, z) D a 2 (z) 


( 12 ) 


Here, bh(M,z) is the line ar bias of the clust er halo (which 
we calculate according to lTinker et al.lboid l and Da(z) is 
the angular diameter distance to redshift 2 . We have defined 
the projected linear excess Lagrangian depth W (6, z) as 


W(0,z) = [D a {z){1 + z 

r +ac 


'—AC 

f l — 

/ 27T 


6in ( ^\DAz)G+z)&fTG, *) dC 


Jo(W)Plin 


l 


(1 + z)D a {z) ’ 


(13) 


where ±A£ is the interval in comoving distance along the 
LOS integrated in our simulated projected density profiles. 
For large enough A£, almost all correlated matter is included 
in the integration and the latter equality holds. In eqn. 1131 
we have used the Bessel function of the first kind and order 
zero, Jo- W has units of comoving volume per solid angle 
and is readily interpreted as the excess in Lagrangian volume 
per solid angle that has moved to the projected vicinity of 
a structure with unit bias due to linear evolution. 

The mass measurements of our haloes from the simula¬ 
tions contain all particles within r 2 oom- We therefore need 
to correct for contributions of two-halo matter M 2 h when 
defining the mass of the one-halo profile, which we do at 
first order as 


M 200 m,lh = AL200m — M2h{r200m, M200m, z) . (14) 

Inside a sphere of radius r around the cluster centre, the 
mass of two-halo matter is 


M 2 h{r,M,z) = b h (M, z) p cfi Q m ,o U(r, z) . (15) 


Here we have defined the linear excess Lagrangian volume 
inside a sphere of radius r as 


r(l + z) 


U(r,z) = 


dr-'^n n {r',z) 4nr' 


= D 2 (z)U(r{l + z),0) . 


(16) 


3.1.3 Comparison to simulations 

Figure |T] shows model and mean profiles of simulated haloes 
in two mass bins spanning most of the dynamic range of our 
halo catalog (see Section 0 for details on the simulations). 
The model fits the data well at projected r > 0.3 x r 2 oom, 
but moderately overestimates projected density at smaller 
radii, where an un-truncated NFW profile without two-halo 
contribution is a better fit. Potential reasons for the dis¬ 
crepancy include factual deviations of the simulated dark 
matter haloes from the NFW profile at small radii (cf. 
iBecker fe Kravtsovll201l1 . their Fig. 2, for an analysis based 
on the same cluster sample), resolution effects at small 


radii and the simplified nature of our linear superposition 
of the collapsed halo pr ofile with a linear two-halo term (cf. 
lHavashi fe Whit3l2008l for a different approach, in which the 
three-dimensional density is assumed to be piecewise equal 
to the NFW one-halo or a linear two-halo term, only). Since 
we only use the model for re-normalization of the ensemble 
mean profile to the mass of a given cluster (cf. Section T3. 3D . 
this is not problematic for our purposes. 


3.2 Components of profile covariance 

We have made the ansatz for the residual between a given, 
noise-free cluster k profile K and the mean profile at its 
mass k(M) as a sum of multivariate Gaussian vectors corre¬ 
sponding to four distinct physical effects: uncorrelated large 
scale structure along the LOS, variations in concentration, 
halo asphericity and orientation, and correlated haloes in 
the cluster substructure and neighbourhood (cf. eqn. [7]). All 
of these effects have the potential to bias the lensing mass 
measurement or, equivalently, can be interpreted as a source 
of intrinsic covariance between the components of K — k(M) 
that causes additional intrinsic noise on a cluster-by-cluster 
basis. In this section, we provide analytic expressions for all 
four components of the intrinsic covariance. 


3.2.1 Uncorrelated large-scale structure 


For a given source population, random structures along the 
LOS introduce a covariance in convergence measured in an¬ 
gular bins, which can be written as (e. g. [Schneide r et al.l 
Il998fc lHoekstrall2003l : lUmetsu et al.|[20ll] l 

c ff s = J ■ (i 7 ) 


Here, Jo{ldi) is the area- weighted average of J o over annu- 
lus i. Using the identity (lAbramowitz fe Stegunlll965l . their 
eqn. 9.1.30) 


(~) [z l Ji{z)]=z l ~ k J,- h (z). (18) 

with k = 1 = 1 and integrating yields (cf. also lUmetsu et ahl 
1201 ll . their eqn. 16) 

Jo{l0i) = -^—[{l + d)Ji(Wi(l + d))-(l-d)Ji(Wi{l-d))] , 

Jjdit/i 

(19) 

where we have assumed an annulus of width 2 d6i from (1 — 
d)0i to (1 + d)6i. 

The convergence power spectrum P K in eqn. [IT] is ob¬ 
tained from_the matter power spectrum by means of the 
iLimbed (1 19541 1 approximation as 


P K (l) 


9 H 2 n 2 m f X2 , 

—/ dx 


f Xs-X. 

\Xsa(x) 


2 

Pnl(l/X,X) , 


( 20 ) 


where we have assumed a fixed comoving distance of the 
sources Xs- F° r calculating U LSS around our cluster haloes, 
we set the source redshift to a fixed z s = 1 and limit the 
interval [xi>X 2 ] to the range of ±200/i -1 comoving Mpc 
included in the boxes around the cluster haloes (cf. Sec- 
tion[5|. For t he non-linear matter power sp ectrum P n 1 we use 
the model of ISmith et al.l d2003l l with the lEisenstein fc Hul 
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Figure 1 . Mean convergence profiles of haloes at the low-mass (left) and high-mass (right) end of our sample, shown for haloes at the 
simulation redshift z = 0.24533 with corresponding angular (bottom axis) and r 200 m (top axis) scales. Black lines show average profiles 
in the simulations, with error bars giving the uncertainty of the mean. Our model is shown as the red, solid line, composed of the BMO 
one-halo term (red, dotted) with truncation radius according to equation [TT] and the two-halo term (red, dashed). The blue, dotted line 
shows un-truncated one-halo NFW profile for comparison. 


fil998i ) transfer function including baryonic effects. We per- 
form the integral us ing a customized version of NICAEA 
dKilbinger et al.ll2009l ). The result is shown in the lower left 
panel of Fig. [2] 


3.2.2 Concentration 

The covariance due to variations in concentration parameter 
at fixed mass is calculated as 


( - x ij 


d P(c)KiKj 


dP(c)Ki 


d P(c)Kj 


( 21 ) 


where Ki is taken to be the convergence of a halo of 
concentration c in annulus i. For the probability density 
dP(c ), we assume a log-normal scatter of oi ogl0 c 20 om = 
0.18 feullock et al.ll200ll) about the mean concentration of 
iDuffv et al. ( 20081 ). The result is shown in Fig. [5] lower 
second-to-left panel. As can be seen, it is subdominant at 
the radii considered here relative to other sources of intrin¬ 
sic profile variation, at least for clusters of moderate mass, 
which illustrates the difficulty of measuring individual clus¬ 
ter concentrations with weak lensing. 


the stochastic variation of the number of correlated haloes 
around a cluster also contributes to the covariance of n. 

We calculate t his effect in a halo model, in analogy to 
iGruen et al. 1 (honl. their Appendix A2). The idea is to split 
the set of all possible haloes into subsets that are alike in the 
sense that they cause a similar shear signal, e.g. ones that 
are of similar mass and projected distance from the cluster. 
We then apply Poissonian statistics to the number of haloes 
that is actually present from each subset. 

Let H = {h} be the set of tuples h that completely char¬ 
acterize all possible halos, e.g. in terms of their coordinates 
and masses. Consider mutually exclusive and collectively ex¬ 
haustive subsets Hk C H that are small enough such that 
MPtk : Vh a ,hb £ Hk : S ‘ l (h a ) ss is a good approxi¬ 

mation for the surface mass density of such a halo averaged 
over annulus i. Then we can write the surface mass density 
of correlated haloes E(, orr in annulus i as 

K OI r= ^^(hk) ( 22 ) 

k 

where A k ^ 0 is a random variable that describes the 
number of correlated haloes £ Hk realized in a manifes¬ 
tation of the cluster. Defining a halo excess probability den¬ 
sity dP c around a cluster with properties h c i such that 
f Hk dP c (h\h ci) — (\k) , this immediately results in the ex 
pectation value 


3.2.3 Correlated large-scale structure 

Clusters of galaxies are likely to form in overdense regions, 
where the abundance of additional (correlated) haloes is 
also higher than average. The mean effect of this on pro¬ 
jected density is the two-halo term of Section f3.1.2l However, 


(SLrr) = j dP c (h\h cl )Z\h) . (23) 

For the covariance we make the assumption (discussed 
below) that the population random variables A k and A; of 
the halos in sets Hk and Hi are Poisson distributed and 
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Figure 2. Illustration of our model for the profile covariance. Top left panel shows the empirical intrinsic covariance of k , profiles between 
1 and 30 arcmin of simulated clusters of mass M 200 m — 1.8... 2.2 x 10 14 Mq. Our model prediction for the covariance is shown in the 
top right panel. It is a linear combination of the components shown in the bottom panel. Even at this relatively low mass, uncorrelated 
large-scale structure (left in lower panel) is an insufficient description of the observed covariance, especially in the inner regions. We 
model the additional covariance as a linear combination of variation of the concentration parameter, the covariance of correlated haloes 
and halo ellipticity and orientation (remaining plots in lower panel). All panels use the same color scale with contour lines spaced by 
Alog 10 Co vk = 0.25. At the mass plotted here, c corr ( 7 corr j s the dominant component of the intrinsic (co)variance on most scales, with 
significant contributions from c ell C e11 in the innermost region and subdominant concentration scatter. We note, however, that the relative 
importance of these three components changes as a function of mass (cf. Fig. |3|>. 


mutually independent (Cov(Afc, Xi) = SkiXk)- Then 

ctr x sLt = J dP c (h\h cl )^(h)J: j {h) . (24) 

If we characterize secondary haloes only by their mass 
and projected angular distance from the cluster centre, h = 
(M, 9), we can write 

dP c (h\h cl ) = b(M cl )b(M) dN d ^y^ W(0, z c i) 2ir9d9dM , 

(25) 

Here we have introduced the halo mass function 
and used the projected linear excess Lagrangian depth W 
from eqn. 1141 which assumes that pairs of haloes of masses 
Mci and M cluster according to the linear matter two-point 


correlation and a mass dependent linear halo bias b as 

£hh(M c i, M, r ) = b(M c i, z cl )b(M, z c i)£iin(r, z c \) ■ (26) 

We perform the two-dimensional integral of equa- 
tions !24l and l25l factoring out the cluster halo bias b(M c i, z c i) 
for later re-scaling of the covariance matrix to clusters of any 
mass. For the halo mas s function an d halo bias we use the 
models of I Tinker et al.l d200sl . 120101 ). The haloes are mod¬ 
elled as BM P profiles (cf . Sectio n f3.1.1l) with concentration 
according to iDuffv et al.l (l2008l 4 and the truncation radius 
model of Eqn. [TT] We include contributions from haloes of 
mass 10 8 /i _1 Mq ^ Mioom < 10 15 ' 5 /i~ 1 Mq. 

Finally, we note that a mutual (three-point) correlation 
of different correlated haloes is in fact expected, i.e. the pres¬ 
ence of one massive, secondary halo makes the presence of 
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tertiary haloes more likely. This means that the approach 
presented above yields merely a lower limit of the true co- 
variance, yet this can be approximately compensated by an 
empirical re-scaling of the covariance matrix with c corr . The 
resulting covariance matrix is shown in the lower second-to- 
right panel of Fig. [3] 


Non-linearly correlated subhaloes 


The model for correlated large-scale structure in Sec¬ 
tion 13.2.31 assumed a linear correlation of secondary haloes 
at all radii. A more realistic approach might be to make this 
assumption only for haloes outside the virial radius and add 
a distinct population of subhaloes. We test this model as 
follows. 

For the subhalo a bundance inside the virial radius 
dBrvan fc Normarl 1998|, their eqn. 6) we use t he subhalo 
mass function of Jiang fc van den Boschl d2014lh The sur¬ 
face density of haloes is distributed according to an NFW 
profile with concentration C 2 oom — 3.9, adapted fr o m the 
me asurement o f C 2 oo e = 2.6 by iBudzvnski et al.l d2012l l 
for iDuffv et al.l d2008l J halo concentration, M 2 oom = 2 x 
10 14 /i -1 Mq and the snapshot redshift. These assumptions 
yield d P c for the subhalo case. 

We assume a truncation of sublialoes at their r 2 oom ra¬ 
dius, i.e. T 2 oom = C 200 m(M su b), motivated by halo stripping. 
For the projected density profile of the individual subhalo, 
we ensure mass compensation by subtracting as much mat¬ 
ter according to the mean subhalo density profile (eqn. 1231) 
as is contained in the individual halo. Otherwise we apply 
the prescription of Section 13.2.31 to determine the subhalo 
shot noise covariance matrix C' aub . In this we are ignoring, 
as before for the linear correlation, both a potential depen¬ 
dence of the subhalo number density profile on the parent 
halo concentration and ellipticity (which might cause corre¬ 
lations between C sub and the respective other components 
of the model) and mutual correlation of subhaloes. 

The covariance for linearly correlated haloes in this 
model is calculated as in eqn. I2J1 yet using a virial sphere 
excised version of W(9) to calculate dP c (h\h c \) in eon. 1251 
Note that C corr in this model explicitly depends on the halo 
mass rather than being a constant template that is re-scaled 
by the central halo bias. 

We find that the predicted subhalo covariance is a sig¬ 
nificant contribution to the intrinsic covariance of the halo 
profile at the high mass end of our sample. The contribution 
is dominated by massive subhaloes that are sufficiently re¬ 
solved by our simulations. When we fit a linear dependence 
of a re-scaling parameter c Bub = Co ub + Ci ub (iz — j/Q Ub ) to¬ 
gether with Cq/ 7 and c'fj 1 as described in Section [I] below, 
we find c B 0 ub = 0.8 ±0.8 and cf ub = -2 ±1.5. Fixin g c| ub = 0 
and Ci° rr = 0 yields a best-fit Co ub = 0. The contribution of 
subhaloes is particularly degenerate with the halo aspheric- 
ity covariance C el1 (see Section 13.2.41) . We conclude that a 
larger sample of massive haloes would be required to deter¬ 
mine whether the addition of (7 sub improves the model and 
therefore do not include it in the following analyses. 


3.2.4 Halo asphericity 


Dark matter haloes are known to be triaxial in general. The 
axis ratios and their orientation along the LOS can change 
the projected k profile significantly. It is difficult, however, 
to model the effect in its full generality. 

We therefore make the following simplified model for 
the covariance due to halo asphericity. In accordance with 
the dominant feature of halo shapes in nature, we assume 
them to have a prolate shape with minor-to-major axis ratio 
0 <q = b/a = c/a^ 1. We define a coordinate system with 
ellipsoidal radius r e such that 

0 0 \ 

q~ 2/3 0 * , (27) 

0 q 4/3 ) 

where x T = ( x, y, z ) is a Cartesian coordinate system cen¬ 
tred on and aligned with z along the major axis of the halo. 
For the spherical coordinate system ( R, 9, (j>) with 9 = 0 
along the major axis, 

r e (R, 9)/R = yjq~ 2 / 3 sin 2 9 ± q 4 / 3 cos 2 9 . (28) 



We find the three-dimensional density of a prolate NFW 
halo by evaluating equation 0 at r = r e . By virtue of 
the unit determinant, the volume of ellipsoidal shells is 
V(r e ,r e ± dr e ) = d-irr^dr e . The usual normalization of 
po therefore means that the density integrated out to an 
ellipsoidal radius r e = r 2 oom matches M 2 oom- However, 
we need to re-scale po numerically for the mass inside a 
sphere of radius r 2 oom to match M 2 oom (cf . I.Iing fc Sutol 
120021 : lOeuri et al.ll2003l : ICorless fc King|l2007l for definiti ons 
based on ellipsoidal overdensity and Dietrich et al. 1 120141 for 
a spherical overdensity approach to ellipsoidal haloes). 

Integration of p along the LOS yields the surface mass 
density for any combination of M 2 oom, 0200 m, q and orien¬ 
tation angle a , where a = 0 puts the major axis along the 
LOS. We take care to use an approximation to the analytical 
result for the integrated density inside the innermost region, 
r/r s < 10~ 3 , where the diverging density leads to numerical 
instability. Here we can approximate 9 ss a for most of the 
matter along the LOS to find the mean surfac e density in¬ 
side a small projected radius 8 v = 5r/r a as fcf. iBartelmannl 
ll996l : IWright fc Brainerdll2000l ') 


(E)(< Sv ) = p 0 r s 
x ((1 - e 2 )-V3 sin 2 Q + (1 _ e 2)2/3 cos 2 ^ -1/2 

x ji(7r=1j5" c *‘ ,,h (VT^) +1,, (T)) • (29) 

The covariance matrix is integrated as 


Cfj = / dP(q, cos a)KiKj 


d P(q, cos a)Ki 


d P(q, cos ot)nj 


(30) 


where is the mean surface density of a halo of axis ratio 
q and orientation angle a in annulus i. We assume isotropic 
orientation, i.e. a uniform distribution of cos a £ [0,1], and 
a truncated Gaussian distribution of q 


P(q ) °c 


Af(p = 0.6, a = 0.12), 0.1 
0, q < 0.1 Wq > 1 ’ 1 ’ 
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approximating the distribution of q for haloes of mass as 
10 14 Mq as measured in simulations by iBett et all J2007I . 
cf. their Section 4.3). The resulting covariance matrix, re¬ 
scaled empirically by c conc , is shown in the lower right panel 
of Fig. [2] We see that the influence of halo ellipticity is 
limited to small radii compared to correlated and uncorre¬ 
lated structures along the LOS, yet quite significant in that 
regime. 


3.3 Covariance estimation 

Given a model for /t(M 2 oo m ) that we can subtract from the 
observed K, any simulated cluster of mass Afeoom yields an 
estimate for C as 


Co v(Et, Ej) = (EtEj) . (32) 

While this approach heavily relies on the accuracy of k, it is 
also possible to estimate the covariance matrix C(M 2 oom) 
from an ensemble of N simulated clusters of fixed mass 
Mioom without assuming a model for the mean profile. This 
is done by applying the ensemble covariance estimator 

Cov(E i ,E j ) = 1 ^- [ {E i E j ) , (33) 


where E is the residual with respect to the ensemble mean 
profile. The variance of the estimator of eqn.[33]is larger than 
the one of eon. 1321 bv a factor N/(N — 1). We decide to bin 
our clusters in mass in subsamples of N = 24, such that the 
loss of information due to not assuming a model for the true 
mean profile is negligible. However, especially at the massive 
end where the number of clusters in our simulations is small, 
this would introduce an additional variance of profiles due 
to the systematic change of mass within each subsample. We 
therefore define 


N 


N 


— Ut - n- 1 K 'i 


(34) 


where Kl and are the actual and model convergence of 
cluster j in radial bin i. Here, we have re-scaled each of 
the other cluster profiles to the expected value at the mass 
of cluster k by means of the model and amplified the de¬ 
viations from the mean by \JN/(N — 1) to correct for the 
bias of the maximum-likelihood ensemble variance, such that 
Co v(Ei,Ej) = Cov(Ei, Ej). 


4 DETERMINATION OF COVARIANCE 
MODEL PARAMETERS 

The multivariate Gaussian of equation [3] corresponds to a 
log-likelihood 

— 2 ln£ = lndet C(M) 

+ (k(M) - K) t C~ 1 (M)(k{M) - K) 

+ const . (35) 

This could be interpreted as a likelihood of mass or of the 
parameters of both the mean profile model n(M) and the 
covariance model C(M). For the latter, the first term on 
the right-hand side serves as a regularization that prevents 
run-off of the (co)variance estimate to infinity. In our case, 
as discussed above, we use a non-parametric profile model 


to replace n(M) — K by the E of equation [31] and maximize 
the likelihood to constrain the parameters of the covariance 
model. 

Equation [33] has the disadvantage that the precision 
matrix C~ 1 (M) and det C(M) become numerically unstable 
due to the large conditional number and strong covariance 
between neighbouring radial bins. Related to this, the uncer¬ 
tainty of empirically estimated off -diagonal compone nts of 
the covariance matrix is large (e.g. iTavlor et al .1120131 . their 
eqn. 18). We therefore decide to use equation 1351 however 
with a diagonal model covariance matrix, i.e. one where all 
off-diagonal components are set to zero. We verify, using a 
toy model, that this yields an unbiased maximum-likelihood 
estimate of the covariance model parameters. 

For the covariance model, we assume equation [7] 
The contribution of concentration becomes important only 
at large mass and small radii, and rather than (poorly) 
constraining i t from the data we decide to adopt the 
[Bullock et al.l ll200lh log-normal concentration scatter, i.e. 
we set c conc = 1. 

As a baseline for c corr (i/) and c ell (i^) we use the mass- 
independent model 

corr/ \ corr 

c (v) = c = const 

c ell (v) = c el1 = const . (36) 

As a test for mass dependence we also run our analysis with 
a model that allows for linear evolution in u, 

c (v) = Co +{v-v 0 )Ci 

c e V) = cf + (i/ - vf)cf , (37) 

where we set the pivot points Vq° tt = 2.5 and v d” == 2.8 to 

make errors on Cq and c\ approximately uncorrelated in the 
bootstrap runs. 


4.1 Results 

The best-fitting values for the parameters of eqn. [33] with 
errors estimated from bootstrapping are c corr = 5.0 ± 0.3 
and c el1 = 3.7 ± 0.3. Allowing for a linear dependence as 
in eqn. cm these values are unchanged at best fit and we 
get no indication (but relatively poor constraints) for a v 
dependence with Cj orr = —0.04 ± 0.88 and Ci® 11 = —0.05 ± 
0.79. For the remainder of the analysis, we therefore adopt 
v independent c corr = 5.0 and c el1 = 3.7. 

Figure [3] shows the full covariance model with these pa¬ 
rameters for M 2 oom = 2x10 14 /i _1 Mq. Figure[3]shows model 
and data variance for a range of masses. The model repro¬ 
duces observed intrinsic variations well over a wide range 
of mass. We note, however, the large uncertainty in data 
variance due to our limited sample size. 


4.2 Redshift dependence 

In order to test for a potential redshift dependence of our 
model parameters, we repeat the analysis with a second 
snapshot at z = 0.499, using 788 haloes at M 2 oom ^ 
2 x 10 14 /i _1 Mq. 

The best-fitting truncation radii are consistent with 
eqn. 1111 We fit the parameters of eqn. [33] as in Section [3] 
and determine errors using bootstrapping. Empirical covari¬ 
ances are well reproduced by a model with c corr = 4.2 ±1.1 
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Figure 3. Empirical covariance (black, left panel) and covariance model (red, left panel) for clusters of mass M 200 m = 1.8... 2.2 X 
10 1 1 h 1 Mq (solid lines), Aftoom = 3. 5... 4. 5 X 10 14 /i _1 tfg (dotted) and . 1 / 200 ™ = 7... 9 X 10 14 h ~ 1 Mq (dashed lines). Right panel 
shows components of the model with c corr = 5.0 and c el1 = 3.7 for all three cases (uncorrelated LSS inside the snapshot box shown as 
grey, dotted-dashed line). 


and c el1 = 3.8 ± 0.9. Due to the much smaller sample size 
and lower contrast of the haloes relative to LSS noise, the 
parameters are significantly less well constrained as in the 
2 = 0.245 snapshot, however, consistent within the uncer¬ 
tainties. 

We conclude that at the present level of statistical 
certainty the best-fit values of Section 14.11 can be used 
over the most relevant redshift range for cluster lensing of 
2 « 0.2... 0.5. 


5 EFFECT ON WEAK LENSING CLUSTER 

SURVEYS 

In this section, we estimate the influence of intrinsic covari¬ 
ance of density profiles at fixed mass 

• on mass measurements of individual clusters (Sec¬ 
tion ehj 

• and on the determination of parameters of the mass- 
observable relation (MOR) in a lensing follow-up of a sample 
of clusters (Section 15.21) . 

In our analyses we fix the lens redshift to Zi = 0.24533, 
the primary redshift of our simulated cluster profiles. We 
consider shear surveys of different depths, parametrized by 
the surface density n of source galaxies, which we assume to 
lie at a fixed z s = 1. The three settings chosen roughly 
correspond to current large ground based surveys (n = 
10 arcmin -2 ), the best available ground-based data (n = 
50 arcmin -2 ) and space-based data (n = 100 arcmin -2 ). 

The full covariance of equation [7] contains, apart from 
uncorrelated and intrinsic variations of surface mass den¬ 
sity, the measurement uncertainty of the survey G obs . For 
a shear survey, assume that we can measure the tangential 


gravitational shear 7 in each annulus i with variance 


2 


a 


'Yj® 



(38) 


where <r e « 0.3 is the shape noise (including intrinsic 
shape dispersion and measurement noise), n is the back¬ 
ground source density and Ai the area of annulus i. The 71- 
dimensional covariance matrix of 7 (ignoring intrinsic align¬ 
ment and shear systematics) is diagonal with G7 bs ’ 7 = cr 2 ^. 

In the limit that annuli are thin, 7 and ft are connected 
by a linear equation 7 = Gk, where the n x n matrix G is 
defined such that 


7 * = («><; ~ ^ - ' 


(39) 


3=0 


Our goal is to calculate the observational covariance C° 3 of 
ft, which can be written as 

C obs = G - 1 C ,ob 3 ’ 7 (G -1 ) T . (40) 


The above derivation has omitted the technical step of 
breaking the mass-sheet degeneracy, which is necessary for 
G to be of full rank. To this end, we increase the dimension¬ 
ality by 1 and let the newly introduced entry 7 n+i represent 
the convergence in the outermost radius, and the new ft n +i 
be the mean convergence inside the innermost annulus. The 
measurement error of 7 n+i is defined to be the LSS vari¬ 
ance of ft on the outermost angular scale (this is essentially 
what happens in the common assumption of ft = 0 on large 
scales). The mean convergence inside the innermost annulus 
is connected to 7 by the linear equation above. We later ex¬ 
clude the added component from our analysis again, i.e. we 
consider only the sub-ma trix i, j € [ 1 , n\ (similar in effect to 
iMandelbaum et ai1l2010l ). 
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Apart from the full model covariance C(M) of equa¬ 
tion [7] we also define the covariance without intrinsic vari¬ 
ations 

^noint _ £/obs + ^LSS _ ( 41 ) 

For a model independent representation, we calculate 
the ensemble covariance of the re-scaled profiles (cf. eon. 1341) 
as (7 mt (M). When calculating C lnt (M), we use 100 clusters 
that are nearest neighbours in a mass-ordered list of our 
simulated haloes to the target mass M and subtract the 
uncorrelated LSS covariance inside the simulation box. The 
empirically estimated covariance is 

C emp (M) = C ohs + C LSS + C int (M) . (42) 

5.1 Mass confidence intervals 

We determine the effect of intrinsic variations of the density 
profile on the validity of confidence intervals of weak lens- 
ing mass measurements. In this procedure, we assume the 
empirically estimated density profile covariance from our en¬ 
semble of simulated haloes to represent the true variability. 
We test for the effects of either taking into account intrinsic 
variations with the model proposed in this work or ignoring 
them. We design this test as follows. 

At given mass M , we define and determine the full 
empirically estimated covariance C emp (M) as described in 
eqn. Hi In a Monte Carlo simulation, we then generate pro¬ 
files by adding a multivariate Gaussian random vector ac¬ 
cording to C emp (M) to the model profile n(M) (eqn. [5]) of 
mass M Q 

Consider one such realization K. The likelihood in 
eqn. 1351 now interpreted as a function of mass C(M'\K), 
can then be run over a range of model profiles n(M') with 
different mass M'. For C(M') and we use 

• either the covariance without intrinsic variation (7 nomt 
of equation H 2 (colour coded red in the following figures) 

• or the covariance of equation [7] including our paramet¬ 
ric model for intrinsic variations (colour coded blue). 

Note that the latter option requires to include the lndetC 
term of the likelihood, since covariance is now a function of 
mass. 

We determine 68 % (and 90%) confidence intervals using 
mass limits where the likelihood is 2Aln£ = 1 (and 2.7) 
worse than at best fit. We repeat the procedure for 10,000 
realizations at every fixed mass and determine the empirical 
coverage, e.g. the fraction of cases in which the true mass 
lies inside these confidence intervals. 

Results for three different survey depths are shown in 

1 As a test for the validity of considering only multivariate Gaus¬ 
sian variations over the mean profile, we perform another run 
where instead of a synthetic variation we use 100 nearest neigh¬ 
bors in mass around M, re-scale their profile to M using the 
model profiles of eqn. [9] and subtract their mean to generate 100 
realistic random variations. We then add these (and multivariate 
Gaussian random vectors to simulate observational and uncor¬ 
related LSS covariance) to the model profile at M and run the 
likelihood as described below. Results in this approach do not 
differ significantly from the synthetic multivariate Gaussian ones 
presented here. 


Figure HI where we now compare results of the two methods 
(red and blue lines) to the target levels of 68 and 90% (black 
lines). Increasingly with increasing mass, ignoring intrinsic 
variations leads to an underestimation of errors. Even for 
present surveys with relatively shallow depth, this manifests 
in « 15% excess outliers at the one and two sigma level for 
massive clusters at around M 2 oom « 10 15 H~ 1 Mq. The effect 
is more severe in deeper data, where the relative importance 
of intrinsic over observational covariance increases. 

Using our model covariance, confidence intervals are 
correctly estimated over a wide range of masses and even 
in deep data. The small number of systems above Mzoom > 
10 15 /i _1 Mq, however, limits the range of masses where we 
can evaluate the validity of our model. Note in this context 
that the scatter in the plots is due to the noise in C" nt (M), 
as estimated on a limited number of clusters, rather than 
the number of realizations we run (the effect of the latter is 
indicated as the small vertical bar on the black lines). 


5.1.1 Effect of intrinsic covariance on mass uncertainty 

In the previous section, we found that the confidence inter¬ 
val based on a n covariance matrix without contributions 
from intrinsic profile variation is significantly too narrow, 
especially for more massive clusters and larger depth of the 
data. Here, we perform a Fisher analysis to determine how 
well masses can be measured, a question particularly rele¬ 
vant for the design of lensing follow-up programmes. 

For a mass dependent covarian ce matrix, the Fisher in¬ 
formation for mass T reads (e.g. lYogelev fe Szala^ 1 19961 : 
iTegmark et al.|[l997l i 



where k , C and their derivatives are taken to be evaluated 
at mass M. 

We calculate the Fisher information for the model with¬ 
out intrinsic variations (eqn. 1411) and our full model. In 
the first case, the second term of eqn. H 3 drops because of 
the mass independence of the covariance. Including intrinsic 
variations, we also evaluate the second term, which however 
contributes to the Fisher information at most at the per¬ 
cent level. The mass uncertainty ctm is related to the Fisher 
information as ct\m = J r ~ 1 ^ 2 . 

Figure [7] shows relative uncertainties in mass ctm/M due 
to a shear surveys of various depth. The increase in uncer¬ 
tainty due to intrinsic profile variation is substantial, even 
for shallow data with sufficiently massive clusters, but more 
strongly for deeper observations. Our results are consistent 
with the findings of lBecker fc Kravtsovl (l201lh , who used the 
same simulations to derive uncertainties empirically rather 
than with a covariance model. 


5.2 Mass-observable relations 

We study the effect of intrinsic variations of projected cluster 
density profiles on weak lensing follow-up surveys used for 
determining parameters of a mass-observable relation. Con¬ 
sider an observable Y (which could be the Compton decre¬ 
ment, an X-ray mass proxy or an optical richness) with a 
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Figure 4. Fraction of weak lensing measurements of cluster mass 
where the true mass is inside the 68% (/68, solid lines) and 90% 
(/go, dashed lines) confidence interval (black lines at target levels 
for reference). In red, thin lines we show the common case where 
only measurement noise and uncorrelated LSS are considered for 
the covariance matrix (eqn. 1411) . Blue, thick lines show results 
that include our model for the intrinsic covariance (eqn. . Pan¬ 
els correspond to typical large ground based-surveys (top), best 
available ground based data (centre) and space-based analyses 
(bottom). As data quality and cluster mass increase, intrinsic vari¬ 
ations become a significant component of the uncertainty budget 
of weak lensing mass measurements and should not be ignored. 
All plots are for lenses at zi = 0.24533. 



Figure 5. Relative uncertainty of weak lensing mass measure¬ 
ments from a Fisher analysis excluding (red) and including (blue) 
intrinsic covariance according to our model. Not including intrin¬ 
sic covariance leads to an underestimation of uncertainty, espe¬ 
cially at high mass and large depth. Notably, the improvement of 
uncertainty for massive clusters going from typical (10 arcmin - 2 
background galaxies, dotted line) to noiseless measurements (solid 
line) is only modest when intrinsic variations are considered. Re¬ 
sults are for lenses at zi = 0.24533. 


power-law mass-observable relation that describes the fidu¬ 
cial value Yq(M) as a function of mass M, 

\nYo(M)/Y = A + B\nM/M . (44) 

We have introduced here pivot values Y and M in addition 
to the power-law slope B and amplitude A. 

The observed value of Y for any cluster shall include a 
log-normal intrinsic scatter <Ji n t and, for simplicity, also log- 
normally distributed measurement related uncertainty cr 0 bs- 
The combined uncertainty a = of nt + cr^ hs leads to an 
observable 


In Y(M) = In Yq(M) + A/"(/x = 0, a) . (45) 


5.2.1 Likelihood 

Consider a single cluster with observed convergence profile 
K and measured observable Y. We maximize, as a function 
of MOR parameters, the likelihood 


P(K\Y, A, B, (Tint) 


= P(K\M)P(M\Y,A,B,a int )dM 


oc 


P(K\M)P(Y\M, A, B, <7 int )P(M) d M . (46) 


For P(K\M) we insert the expression of eqn.[3]with C equal 
to either our full model or the covariance without intrinsic 
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variations. The term for the Y likelihood can be written as 


P(Y\M, A, B, (Tint) 


exp[—(In Yq(M) - lny) 2 /(2a 2 )] 

V2a 

x e(Y-Y lila )P^l(M). (47) 


Here we have used the Heaviside step function 0 to im¬ 
pose the observable limit Yn m and re-normalized by the in¬ 
verse of the detection probability Pdet(M) = erfc[(lnYn m — 
In Yq{M))/(\ f2o-)\/2 to compen sate for the removed p art of 
the probability distribution (cf. IVikhlinin et al.ll2009al . their 
eqn. A10) and correct for the (Malmquist) bias due to pref¬ 
erential selection of objects with positive contribution from 
scatter. 

Finally, the normalized mass prior P(M) can be written 
for an observable limited survey as 

/ dM'^(M')Pdet(M') 

(48) 

to correct for (Eddington) bias due to the increase of abun¬ 
dance with decreasing mass. Note that P(M) depends on 
the MOR by means of Pdet and, by means of the halo mass 
function , also on cosmology (for which we, however, as¬ 
sume fixed values in the simulations presented here). 


H Ti 

p (M) « ^(M)P d et(M) 


5.2.2 Results 

We simulate samples of clusters 1 ^ ^ n, where cluster i is 

characterized by its observable Y, and an observed conver¬ 
gence profile Ki. 

For Y, we assume a MOR with B = 5/3 (the self-similar 
slope for Compton decrement Ysz and the X-ray equivalent 
Yx) and choose the pivots such that measurement errors 
on A and B are uncorrelated and A = 0. We assume an 
intrinsic scatter (Tint =0.1 and observational uncertainty of 
(T 0 bs — 0.05. 

The convergence profile K is simulated including a mul¬ 
tivariate Gaussian deviation from eqn. [21 according to the 
full covariance (including uncorrelated LSS, measurement 
uncertainty of the survey and intrinsic variation as predicted 
from our model) of eqn. [7] 

We take the sample to be observable limited, i.e. draw 
clusters with abundance proportional to the halo mass func¬ 
tion, assign observables according to eqns. !44[l45l and only ac¬ 
cept objects where the observable exceeds the survey thresh¬ 
old, Y > Yiim- The threshold Yn m is chosen as Yji m = 
Y 0 (M 200m = 4 x 10 14 /i -1 Mq), comparable for instance to 
ongoing SZ surveys. The number of clusters drawn is taken 
to be representative of a volume of V = 0.15h~ 3 Gpc 3 (co¬ 
moving, at z = 0.24533), which contains roughly 100 detec¬ 
tions in our simulated survey. 

Figure [S] shows the distribution of maximum likelihood 
estimates of the MOR parameters A, B and (Tint, with con¬ 
tours enclosing 68 and 95 per cent of the 15,000 realiza¬ 
tions, respectively. We find that the analysis using the full 
covariance (blue) reproduces the input MOR well. The fea¬ 
ture near (Tint = 0 is due to the hard prior (Tint > 0, which 
pushes random realization with lower empirical scatter to¬ 
wards this limit. The analysis using the covariance without 
intrinsic variations (red) significantly overestimates intrin¬ 
sic scatter. As an additional bias, the overall mass scale A 
(and, although less strongly, the slope B) are systematically 


underestimated. The move from a shallow (dotted) to deep 
(solid lines) surveys only moderately improves constraints 
and makes the bias due to using the covariance without in¬ 
trinsic variations more apparent. 

We note that the biases in A and B are due to a degen¬ 
eracy between them and the intrinsic scatter. If the latter 
could be constrained externally with small uncertainty, e.g. 
by means of realistic simulations or by combining several ob¬ 
servables with uncorrelated intrinsic scatter, the bias in the 
former would be mitigated even without a realistic model 
for the lensing covariance. 


6 CONCLUSIONS 

We have presented a model for the variation of projected 
density profiles of clusters of galaxies at fixed mass, con¬ 
structed as a combination of the effects of variations of halo 
concentration, ellipticity and orientation, and correlated sec¬ 
ondary structures. 

The full covariance including our model for intrinsic 
variations faithfully reproduces confidence intervals in the 
weak lensing likelihood of cluster mass. We show that when 
intrinsic variations are ignored, uncertainties in lensing- 
derived mass are underestimated significantly (cf. Fig. [J). 

Using the full covariance model we have made Fisher 
predictions for the accuracy of lensing measurements of clus¬ 
ter mass. We have shown that intrinsic variations take away 
some of the comparative advantage of studying a small sam¬ 
ple of the most massive clusters with the deepest possible 
observations. For a massive cluster ( M 2 oom ~ 10 15 h - 1 Mq 
at 2 = 0.25) we find an irreducible relative uncertainty in 
lensing mass of « 20 per-cent due to intrinsic profile vari¬ 
ations and uncorrelated LSS along the LOS, three times 
higher than the uncertainty from unco rrelated LSS alone. 
Our r esults agree with the analysis of iBecker fe Kravtsov! 
l| 201 llb 

With simulations of mock surveys for constraining clus¬ 
ter mass-observable relations with lensing data, we have 
shown that intrinsic variations significantly bias the derived 
intrinsic scatter and amplitude if they are not accounted for 
in the lensing mass likelihood. For a follow-up of a sample 
of 100 clusters selected by Ysz above the fiducial value of 
the observable at M 2 oom = 4 x 10 14 Y'Mq at a redshift 
2 = 0.25, the bias in the MOR amplitude is ss 15 per cent 
unless tight external constraints on the intrinsic scatter are 
available. Accounting for the cosmic variance of cluster lens¬ 
ing is therefore necessary for upcoming cluster surveys that 
target the calibration of MORs for cluster cosmology. 


ACKNOWLEDGEMENTS 

We thank Gary Bernstein, Eduardo Rozo and Peter Schnei¬ 
der for helpful discussions and Martin Kilbinger for support 
with the software NICAEA used in this work. 

This work was supported by SFB-Transregio 33 ’The 
Dark Universe’ by the Deutsche Forschungsgemeinschaft 
(DFG) and the DFG cluster of excellence ’Origin and Struc¬ 
ture of the Universe’. 


© 2014 RAS, MNRAS OOO.ITIIHI 












Cosmic variance of cluster weak leasing 13 



Figure 6. Distribution of maximum likelihood MOR parameters (true input indicated by black squares) estimated from a weak lensing 
follow-up of an observable limited sample. We show results for a shallow (dotted contours, n = 10 arcmin -2 ) and deeper (solid contours, 
n = 50 arcmin -2 ) lensing survey. Cluster profiles are simulated including intrinsic variation according to our model and the lensing 
likelihood for cluster mass is estimated including (blue, thick lines) and not including (red, thin lines) the intrinsic component of the 
covariance matrix in addition to shape noise and uncorrelated LSS. Inner and outer contours enclose 68% and 95% of data points, 
respectively. Results are for lenses at z; = 0.24533. 
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